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PRUNED  ERROR-TRELLIS  DECODING 
OF  CERTAIN  NON- SYSTEMATIC  CONVOLUTIONAL  CODES 


I.  S.  Reed 


I.  INTRODUCTION 

The  previous  works  on  syndrome  decoding  of  convolutional  codes  (CCs), 
e.g.,  see  Ref.  1,  led  to  what  is  called  error-trellis  decoding  [2],  Since 
the  ending  of  the  previous  NAVAIR  contract  (Contract  Number  N00019-83-C- 
0075)  in  March  1984  and  the  beginning  of  the  present  contract,  considerable 
progress  has  been  made  in  the  concept  of  error-trellis  decoding.  It  was 
found,  using  the  fact  that  CCs  are  capable  of  correcting  only  t  errors  in 
some  multiple  of  the  constraint  length,  that  the  new  algorithm  requires 
generally  only  a  reduced  or  "pruned"  trellis.  In  other  words,  it  was  shown 
that  the  finite  error-correcting  capability  of  the  CC  makes  possible  a  ponr 
ing  of  sometimes  a  substantial  number  of  states  or  paths  in  a  constraint 
length  of  the  error  trellis. 

Quantitative  formulas  for  the  extent  of  the  primary  process  for  all 
systematic  CCs  and  certain  non-systematic  CCs  were  found  recently  [3]  in 
collaboration  with  J.  M.  Jensen,  a  visiting  scholar  at  the  University  of 
Southern  California.  In  this  report,  the  underlying  theory  and  derivation 
of  these  results  are  given  and  applied  to  the  special  case,  the  dual-K  CCs. 
The  non-systematic  dual-K  CCs  were  first  introduced  by  Viterbi  and  Oden- 
walder  [4].  Such  a  code  is  non-binary  and,  for  some  applications,  a  possi¬ 
ble  substitute  for  a  Reed-Sol omon  block  code. 


It  will  be  shown  that  a  CC  with  a  pruned  error  trellis  can  be  decoded 
using  a  modificat n  of  either  the  Viterbi  or  sequential  decoding  algorithm 
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with  often  a  significant  reduction  in  complexity.  It  is  expected  that  de¬ 
coding  with  a  pruned  error  trellis  may  make  it  possible  to  decode  CCs— and, 
in  particular,  the  high-rate  CCs— with  a  larger  error-correcting  capability 
than  heretofore  considered  practical. 

In  order  that  this  report  may  stand  alone,  the  properties  of  the  CCs 
required  in  the  report  are  reviewed  briefly  in  this  introductory  section. 

In  Section  II,  the  general  techniques  developed  previously  [2]  for  error- 
trellis  decoding  are  summarized  and  a  set,  E,  of  error  sequences  which  can 
be  decoded  by  error-trellis  decoding  is  defined.  In  Section  III,  a  speci¬ 
fic  procedure  is  found  for  pruning  the  error  trellis  of  a  certain  class  of 
non-systematic  CCs,  which  includes  all  systematic  CCs  and  the  dual-K  CC. 
Formulas  for  the  number  of  resulting  states  and  transitions  after  pruning 
are  found  for  any  CC  with  free  distance,  dfree,  of  this  sub-class  of  CCs. 
The  results  are  applied  to  the  dual-K  CC  in  Section  IV. 

Let  the  information  or  message  sequence,  the  input  to  the  CC,  be  repre^ 
sented  by 

i<D>ah(D) . *k(d!  • 

where 


x 


i 


OO 


0=0 


for  1  <  j  <  k  are  elements  in  F[D],  the  ring  of  polynomials  in  the  unit  de¬ 
lay  operator  D  over  F  =  GF(q),  a  Galois  field,  with  q  a  power  of  a  prime 
integer.  Vector  2< ( 0 )  is  a  generating  function  in  D  of  the  input  message 
sequence  x  =  [xq,  •••»  2Lj»  •••]  »  where  s  [xji»  •••»  xj|<]  1S  a  vector 
belonging  to  V^F),  the  k-dimensional  vector  space  over  F.  ,x(D)is  sometimes 


called  a  D-transform  of  the  message  or  information  sequence  x.  The  k  com¬ 
ponent  vector  x.  in  x  is  called  the  information  frame  at  stage  or  frame 

J 

time  j. 

In  a  similar  manner,  the  output  sequence  is 

z(°)  -  h(D) . yn(D)]  * 

where  y^D)  e  F[D].  Vector  y(D)  is  the  D-transform  of  output  coded  sequence 
l  =  [jf0»  •••»  ...].  where  =  [y^,  ....  yjk]  belongs  to  V0(f).  The 

n-vector  v.  is  called  the  j-th  codeword  frame  of  code  sequence 

J 

The  information  and  code  sequences  of  an  (n,  k)  convolutional  code 
are  linearly  related  by  a  k  x  n,  rank  k,  generator  matrix  G(D)  of  polynomial 
elements  in  F[D],  as  follows: 

*(D)  =  x(D)  G(D)  . 

The  maximum  degree  m  of  the  polynomial  elements  of  G(D)  in  D  is  called  the 
memory ,  and  the  constraint  length  Lis  defined  as  L  *  m  +  1 . 

The  free  distance  of  a  CC  is  defined  by 

min  .  v 

dfree  =  *(D)  ±  0  W  ’ 


where  WH(y(D)j  is  the  cumulative  Hamming  weight  of  the  coefficients  of 
DJ  for  all  j,  0  <  j,  where  v.  is  the  j-th  codeword  frame.  Note  that  the 

J 

computation  of  d^ree  requires  at  least  L  codeword  frames  for  all  codes  of 
practical  interest. 

To  avoid  catastrophic  error  propagation,  G(D)  is  assumed  to  be  a  basic 
encoder  [5],  The  Smith  normal  form  of  a  basic  encoder  [2]  is 

G(D)  =  A(D)  [lk,  o]  B(D)  ,  (5) 


W*l  IWUU 
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where  A(D)  and  B(D)  are,  respectively,  k  x  k  and  n  x  n  invertible  matrices 
over  F[D]  and  Ik  is  a  k  x  k  identity  matrix. 

In  Eq.  (4),  let  matrix  B[D]  be  partitioned  as 

B(0)  =  [b,(d)T,  B2(0)Tf  , 

where  E] (D)consists  of  the  first  k  rows  of  B(D)  and  "T"  denotes  matrix  trans¬ 
pose.  Similarly,  let 

B(O)’1  -[b,(d),B2(d)]  , 

where  B^D)  consists  of  the  first  k  columns  of  B(d)-1.  Since  b(d)  •  B(d)-1  = 
In,  the  following  identities  evidently  hold: 


8i (o)  Bl(B)  -  Ik 

•  Bi(D)  ’  M°) 

b2(°)  .  5,(0)  -  o 

.  b2(d)  .  b2(o) 

A  parity-check  matrix  H(D)  is  an  (n  -  k)  x  n  matrix  of  rank  (n  -  k), 
satisfying 


G(D)  •  HT(D)  =  0  .  (7) 

From  Eqs.  (5),  (6),  and  (7),  it  is  seen  next  that 

H(D)  =  82(d)T  (8) 

has  the  properties  of  a  parity-check  matrix  H(D)  associated  with  G(D). 

By  Eq.  (3),  the  CC  generated  by  G(D)  is  the  set 

c  ■  (*(°>  -  [y,(0) . y„(D)]  |  i(D)  =  x(D)  G(D)  |  .  (9) 

It  is  now  shown  also  that 

c  =  {z(0)  =  [y^o).  ....  yn(o)]  |  y(o)  ht(d)  =  o  J ,  (10) 


where  H(D)  is  given  in  Eq.  (8).  To  see  this,  denote  the  right  side  of 
Eq.  (10)  by  C^.  Clearly,  an  element  of  C,  as  given  in  Eq.  (9),  belongs  to 
CH ,  and  hence  C  £  . 

Next,  suppose  y.-|(D)  is  an  element  of  C^,  i.e.,  by  Eqs.  (8)  and  (10), 
Xl(0)  HT(D)  =2,(0)  8,(0)  =  0  . 

But,  by  definition,  B~(d)  consists  of  the  last  (n  -  k)  columns  of  b(d)-', 
so  that 

ro 

B2(d)  =  b(o)'1 

.'n-k. 

where  "0"  denotes  a  block  of  k  rows  of  zeros  and  I  ^  is  the  (n  -  k)  row 
identity  matrix.  Thus,  y-|(D)  satisfies  the  equation 

ro  ' 

2,(o)b-1(d)  =0. 

.'n-k 

The  most  general  solution  of  this  equation  for  2,(d)b  '(d)  is 

2,(0)  B"'(D)  =  [r , ( 0 ) . rk(0),  0 . o]  =  [j(o) ,  o]  , 

where  t.(d)  for  1  <  j  <  k  can  be  chosen  to  be  any  arbitrary  elements  of  F[D] 
Solving  for  ^ ( D )  yields,  finally,  by  Eq.  (5), 

Z7(d)  =  r(0)  [lk,  0]  B(D)  =  t(d)  A_1(D)  G(D)  , 

which  belongs  to  C,  as  given  in  Eq.  (9).  Thus,  CH  £  C  and  Eq.  (10)  is 
proved. 

The  fact  that  the  CC  given  by  set  C  in  Eq.  (9)  can  be  characterized 
by  Eq.  (10)  is  used  in  the  following  section  to  find  the  coset  of  solutions 
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to  the  syndrome  equation.  It  is  also  used  to  find  another  representation 

of  this  coset  needed  in  error-trellis  decoding.  j- 


II.  ERROR-TRELLIS  DECODING 

Let  y(D)  in  Eq.  (3)  be  transmitted  and  zjD)  be  received.  Then, 

z(D)  =  ^(D)  +  e(D)  ,  (12) 

where  ejD)  is  the  D-transform  of  the  error  sequence.  By  Eqs.  (12)  and  (7), 
the  syndrome  of  the  received  sequence  is 


s(D)  =  z(D)  •  HT(d)  =  [y_(D)  +  _e(D)]  •  HT(d) 

=  e(D)  •  HT(D)  . 

The  problem  of  syndrome  decoding  is,  given  ^(D)  -  z(D) 
solve  the  syndrome  equation 


HT(  D) ,  to 


(13) 


s(D)  «z(D)  HT(D)  =  e(D)  HT(d)  ,  (14a) 

or  its  equivalent, 

(e(D)  -  z(D))HT(D)  =  0  ,  (14b) 

for  all  solutions  e(D). 

By  Eqs.  (10)  and  (9),  the  term  |<j(D)  -  zjD)j  in  Eq.  (14b)  must  be  some 
code  sequence  v_(D)  G(D).  Hence,  the  most  general  solution  of  the  syndrome 
equation,  Eq.  (14a),  is 

e(D)  =  z(D)  +  v(D)  G(D)  ,  (15) 


where  v(D)  is 


Equation 
equation,  Eq. 


the  D-transform  of  an  arbitary  message-like  sequence  v^  = 

. . . j  of  k-vectors  v^  €  Vk  (F) . 

(15)  shows  that  the  most  general  solution  of  the  syndrome 
(14a),  for  e(D)  is  the  coset 


ji 
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Cz  =  je(D)  =  z.(D)  +  v ( D )  G(D)  |  v(D)  =  [v^d) . vJd)]}  (16) 

of  code  C,  defined  by  either  Eq.  (9)  or  Eq.  (10).  A  minimization  of  the 
Hamming  weights  over  all  elements  of  coset  Cz  yields  the  standard  minimum- 
error  solution  for  message  vjD).  Efficient  methods  for  achieving  this 
minimization  include  the  Viterbi  algorithm  and  all  sequential  decoding 
methods  for  convolutional  codes. 

The  difficulty  with  the  standard  decoding  methods  of  CCs,  i.e.,  Viterbi 
or  sequential  decoding,  is  the  need  to  consider  a  sometimes  prohibitively 
large  number  of  states  and  paths  in  the  decoding  trellis.  Such  minimum- 
weight,  path-finding  decoding  methods  do  not  take  advantage  of  the  limited 
error-correcting  capability  which  one  might  expect  could  reduce  the  number 
of  paths  in  the  trellis  over  which  this  minimum  is  taken.  One  method  which 
allows  for  such  a  reduction  in  the  number  of  paths  in  the  trellis  is  to  use 
another  equivalent  solution  of  the  syndrome  equation,  Eq.  (14),  which  is 
independent  of  the  transmitted  codeword. 

Another  solution  of  Eq.  (14)  is  given  by 

e(D)  =  u(D)  G(D)  +  z(D)  R(D)  ,  (17a) 

where 

R(D)  =  B2(d)  B2(d)  (17b) 

is  an  n  x  n  matrix  of  rank  (n  -  k)  and  where  matrices  B2(d),  B2( d)  are  de¬ 
fined  in  Eq.  (6).  To  prove  Eq.  (17)  is  a  general  solution  of  the  syndrome 
equation,  note,  by  Eqs.  (17b)  and  (6),  that 

(z(D)  -  z_(D)  R(D)|  HT  =  z(D)  ^(d)  -  _z(D)  B2(d)  B2(D)  ?2(d) 

=  z(D)  B2(d)  -  z(D)  B2(d)  In_k  =  0  . 


b 
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Hence,  by  Eq.  (10),  zjD)  -  zjD)  R(D)  is  a  possible  codeword,  say 
x_i(d)  G(D),  where  ><i(d)  is  some  k-vector  of  elements  in  F[D] .  That  is, 

2(D)  -  z(D)  R(D)  =  x-|  (D)  6(D) 

or 

z(D)  =  2<-j (o)  6(D)  +  z(D)  R(D) 

for  some  information  vector  x^(D).  A  substitution  of  zjD)  in  the  above 
equation  into  Eq.  (15)  yields 

e  =  ^(D)  +  v(D)Je(D)  +  z(D)  R(D) 

=  u(d)  G(D)  +  z(D)  R(D) 

as  another  general  solution  of  the  syndrome  equation,  where  u(D)  is  an  arbi¬ 
trary  k-vector  of  elements  in  F[D].  Hence,  Eq.  (17)  is  established. 

The  solution,  Eq.  (17),  of  the  syndrome  equation,  Eq.  (15),  has  the 
desirable  property  that  it  is  independent  of  the  transmitted  codeword  y(D). 
To  see  this,  letjea(D)  replace  e^(D)  in  Eq.  (17)  as  the  actual  error  sequence 
Then,  a  substitution  of  Eq.  (12)  into  Eq.  (17)  yields,  by  Eq.  (10), 

e(D)  =  u(D)  G(D)  +  (y(D)  +ea(o))  HT(d)  Bg (d) 

=  u(D)  G(D)  +  ea(D)  R(D)  , 

which  is  independent  of  the  transmitted  message  y(D). 

By  the  maximum  likelihood  principle,  the  most  likely  error  sequence 
e(D)  is  the  one  with  minimum  Hamming  weight.  That  is, 

!  \  min 

WH  e(D) 


with 


Wh(u(D)  G(D)  +  z(D)  R(D) ) 
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e(D)  =  u(D)  6(D)  +  z(D)  R(D)  ,  (18b) 

where  the  minimization  is  taken  over  all  k-vectors  ^(D)  over  F[D]  and  jj(D) 
is  some  k-vector  which  achieves  this  minimization. 

It  is  seen,  from  Eq.  (17),  that  all  possible  error  sequences  are  ob¬ 
tained  by  adding  the  sequence  z^(D)  R(D)  to  the  code  tree  or  trellis  C  in 
Eq.  (9),  presented  graphically.  This  new  trellis  is  called  an  error  trellis. 
Thus,  the  most  likely  error  sequence  e(D)  in  Eq.  (18)  equals  also  the  mini¬ 
mum  weight  path  in  the  error  trellis.  Hence,  by  Eq.  (12),  a  subtraction  of 
ejD)  from  _z(D)  produces  the  best  estimate  £(D)  =  ejD)  -  z(D)  of  the  original 
transmitted  codeword  yjD).  Therefore,  the  best  estimate  x(D)  of  the  original 
information  sequence  x.(D)  is 

x(D)  =  £(D)  G(D)'1 

=  [z(D)  -  0(D)  G(D)  -  z(D)  R(D)]  G(D)_1  (19) 

=  z(D)  G(D)"1  -  0(D)  . 

The  last  inequality  follows,  using  Eqs.  (5),  (17b),  and  (6),  from 


R(D)  G(D)-1  =  B2(d)  B2(d)  B-1 (D) 


-  b2(°)  b2(d)  [bi(°)>  b2(°)] 


A-1 ( D) 


0  . 


This  important  identity  shows  that  0(D),  obtained  in  the  minimization  in 
Eq.  (18),  is  the  "best"  correction  factor  for  the  received  information 
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sequence,  zjD)  G"^  (D) . 

Again,  1 et  _§a( D j  be  the  actual  error  sequence.  Then,  a  substitution 
of  z^(D)  =  y(D)  +  J3(d)  into  Eq.  (19)  yields 

x(D)  =  [j/(D)  +  ea(D)]  G_1(d)  -  0(D) 

=  x(D)  +  ea(D)  G_1( d)  -  0(D)  , 


(20) 


where  x(D)  is  the  original  information  or  message  sequence  in  Eq.  (1). 

Now  ,  define  E  to  be  the  set  of  all  error  sequences  which  can  be 
corrected  by  Viterbi  or  error-trellis  decoding,  as  in  Eq.  (18).  Then,  by 
Eq.  (20),  if  _e  (d)  «  E,  the  estimate  ejD)  found  by  the  minimization  in 
Eq.  (18)  equals _ea( D ) ,  so  that  u^D)  =  ea(D)  G~^(d).  Thus,  the  minimization 
in  Eq.  (18)  needs  only  be  taken  over  those  sequences  u/D)  which  belong  to 
the  set 


(-D 

E 


e(D)  €  E 


)• 


Hence,  the  most  likely  error  sequence  e(D)  is  found  by 


(21) 


min 

u(D)  €  E(_1 5 


Wh|u(D)  G(D)  +  z(D)  R(D) 


(22a) 


with 

e(D)  =  0(D)  G(D)  +  z(D)  R(D)  ,  (22b) 

where  E^"^  is  the  set  of  k-vectors,  defined  in  Eq.  (21). 

Note  that,  if  e,(D)  €  E,  then  the  most  likely  sequence  found  by  Eq.  (22) 

a 

is  equal  to _§a(D),  the  actual  error  sequence.  If  Ja(D)  <  E,  a  decoding  error 
will  be  made  in  both  Eq.  (18)  and  Eq.  (22). 

In  order  to  actually  perform  the  minimization  in  Eq.  (22)  over  the  set 
E^"^,  the  sets  E  and  E^'^ 


must  be  identified.  This  is  generally  impossible. 


However,  as  will  be  shown  in  the  following  section  for  a  certain  restricted 
class  of  non-systematic  CCs,  a  reasonable  approximation  to  sets  E  and 


can  be  found. 


III.  ERROR-TRELLIS  DECODING  OF  CERTAIN  NON-SYSTEMATIC  CCs 

To  find  a  reasonable  approximation  to  E^"^  in  Eqs.  (21)  and  (22),  the 
set  in  which  the  most  likely  sequence  u(D)  can  be  found  for  message  correc¬ 
tion,  consider  first  the  error  trellis  in  more  detail.  By  Eq.  (17),  the 
error  trellis  is  a  coding  trellis  generated  by  the  term  j£(D)  G(D)  plus 
the  term  _z(D)  R(D)"\  which  depends  on  the  received  coded  sequence. 

Thus,  the  underlying  coding  trellis  of  the  error  trellis  is 

w(D)  =  e(D)  -  z(D)  R(D)  =  u(D)  G(D)  , 
which  can  be  conceived  to  be  a  linear  sequential  circuit  with  input  u-  at 

J 

frame  j,  where  u_j  =  j^u^  ,  ....  u^j  is  the  j-th,  k-vector,  coefficient  of 
the  formal  power  series  for  u^D)-,  i.e. 


u(D)  =  ^  Uj  Dj  . 
j=0 

By  expanding  u/D),  G(D),  and  W(D)  into  their  formal  power  series  and  equat¬ 


ing  coefficients. 


* f  (V  W' 


for  j  >  0  is  the  output  of  the  coding  trellis  at  frame  j  in  terms  of  input 
u^  and  the  m  previous  values  of  u  - ,  where  initially  u.  =  0  for  all  j<  0. 
Evidently,  function  F  is  linear  in  each  of  its  k-vector  components  so  that 
Wq  =  0,  the  zero  n-vector. 
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where 

Gq  =  [l,  1,  1,  ....  l]  and 

G1  =  [911’  912 ’  9ln]  ' 

with  g^^O  and  g^.  e  Gf(2K)  for  (j  =  1,  2,  ....  n). 

From  the  above  definition  of  a  dual-K  CC,  the  minimum  distance  of  the 
code  is  d  =  2n.  Evidently  also,  the  free  distance  is 

“free  =  d  =  2"  • 

Hence,  if  no  more  than  t  symbol  errors  occur  in  the  first  two  codeword 
frames,  and 

2t  +  1  <  d  =  2n  or 
t  <  n  -  1/2  , 

then  those  errors  which  occur  in  the  first  frame  can  be  corrected.  In  other 
words,  the  dual-K  CC  is  a  t-error-per-constraint  length-correcting  CC,  where 
t  =  [n  -  1/2]. 

Consider  now  the  example  given  by  Odenwalter  [4,  Fig.  1]. 

Example:  Although  he  does  not  say  so  explicitly,  Odenwalder  uses  the 

Galois  field  GF  (2G) generated  by  the  polynomial  xG  +  x^  +  1,  irreducible 

i  -i\  2  3  4  5 

over  GF(2  ).  If  a  is  a  root  of  this  polynomial,  then  a,  a  ,  a  ,  a  ,  a  , 

aG,  =  1 ,  and  0  are  the  eight  elements  of  Gf(2G).  The  representation  of 

the  elements  as  polynomials  in  a  is  given  in  Table  1. 

The  convolutional  encoder  of  a  rate  1/2,  dual-3,  CC  is  shown  in  Fig.  1. 


ri 


This  is  a  more  abstract  version  of  the  same  encoder  given  by  Odenwalter 
[4,  Fig.  1]. 
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i  r  ik.  -  q(d) 

B(D)  1  =  k 

0  •  !n-k 

and,  as  a  consequence,  G( D) is  delay  free  and  of  form,  Eq.  (47),  so  that 

0=1. 

The  number  of  states  and  transitions  derived  above  in  theorem  2  can  be 
shown  to  be  polynomial  in  mk.  In  fact,  if  mk  »  t,  then  it  can  be  proved 
that 

S(q ,  k,  m,0t)=  £(q  -  1 )  e  mk/0tj0t 
and 

T(q ,  k,  m,  0t)  =  |^(q  -  1 )  e  (m  +  1 )  k/0tj0t  , 

are  the  number  of  states  and  transitions  per  frame  time  required  for  error 
trellis  decoding  of  a  systematic  CC.  For  standard  Viterbi  decoding,  these 
numbers  are  exponential  of  form  q111^  and  q^m+^,  respectively.  These  re¬ 
sults  show  that  the  efficiency  of  erro. -trellis  decoding  improves  dramati¬ 
cally  both  with  increased  rate  and  mk  over  standard  Viterbi  decoding. 

IV.  PRUNED  ERROR-TRELLIS  DECODING  OF  DUAL-K  CONVOLUTIONAL  CODES 

Dual-K  convolutional  codes  are  (n,  1)  CCs  of  rate  1/n,  of  memory  m  =  1, 
and  with  symbols  in  the  finite  or  Galois  field  Gf(2K).  See  Odenwalder's 
paper  [4].  The  generating  matrix  G  is 
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Theorem  2  shows  that  non-systematic  (n,  k)  CCs  with  delay-free  inverse 
generating  matrices  allow  for  the  possibility  of  trellis  pruning.  In  gene¬ 
ral,  the  smallest  reduction  factor  is  obtained  for  such  codes  with  9  =  1  or 
inverse  matrices  of  form 


(47a) 


where 


0  ^  wh(^)  ^  1 »  for  k  +  1  <  i  <  n  . 


(47b) 


Three  classes  of  codes  achieve  9=  1  for  greatest  error  trellis  pruning. 
They  are: 

(i)  All  (n,  k)  systematic  CCs. 

(ii)  All  (n,  1)  non-systematic  CCs  with  delay-free  inverse  matrices. 

(iii)  All  (n,  k)  non-systematic  CCs  with  delay  free  inverse  matrices, 
where  Eq.  (47)  is  true. 

All  other  CCs  with  a  delay-free  inverse  generating  matrix  require  1  <  9  <  k. 
To  show  (i),  above,  observe  that  all  (n,  k)  systematic  CCs  have  a  generating 
matrix  of  form  G(D)  =  £lk>  Q(D)J,  and,  as  a  consequence,  a  Smith  normal  form, 
G(D)  =  [lk,  0 ]  B(D) ,  where 


B(D)  = 


Ik,  Q(D) 
0  ,  I„_u 


Hence,  G(D)-1  =  B ( D ) ” 1  [lR,  o]T  =  [lR>  ojT,  sii 
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states  S.  need  to  remain  in  the  error  trellis  which  have  Hamming  weight  of 

3  /m  k\  .  i 

at  most  9 t.  Since  there  are  exactly  I  .  I  q-1  states  S.  of  weight  j 

/  n  \'  J  ' 

for  0  <  j  <  min  (£?t,  m  k),  where  l  ^  I  denotes  the  binomial  coefficient,  the 
first  part  of  the  theorem  is  proved. 

Consider  now  the  "change  of  state"  equation,  Eq.  (25b),  of  the  coding 
trellis,  associated  with  the  error  trellis  in  Eq.  (17).  By  Eqs.  (25b), 

(28),  and  (30),  a  transition  in  the  error  trellis  from  state 

S j  =  ( al . -m  ) 

at  frame  j  yields,  upon  input  u- ,  the  next  state, 

J 

Sj+1  =(-,,<rl’  *  * ’  *  — m-1 )  ’ 

where  o-1  =  u.  is  the  value  of  the  input  u.  at  frame  j.  Thus,  the  transition 

J  J 

is  determined  uniquely  by  input  a  1  and  state  ,  . ...  )  or  by  the  (m  +  1) 

tuple, 

(^‘  *  ^m-l’  °m)  * 

But,  by  Eq.  (26),  this  (m  +  l)-tuple  or  its  equivalent,  the  transition,  is 
equal  to  (m  +  1)  =  L  consecutive  frames  of  the  sequence  jj(D),  namely, 

(  -  ’  2\  ’  *  ‘  ’ ’ -m)  =  (-j  ’  -j-1  *  *  ’  ’  ’  -j-m )  » 

where  =  0  for  j  <  0.  As  in  the  first  part  of  the  theorem,  only  those 
sequences  which  are  in  Zm  ^  need  to  be  considered.  Therefore,  a  branch¬ 
ing  or  transition  from  state  Sj  =  |^,  .  to  state  Sj+-|  =  (cr‘ ,  » 

•  ..,Zm_i)  needs  to  be  in  the  error  trellis  if  and  only  i f  ,  <T| ,  .... 

<  t.  Hence,  the  total  number  of  transitions  at  frame  j  is  given  by 
Eq.  (46)  for  j  >  m,  and  the  theorem  is  proved. 


If  a  probability  measure  were  imposed  on  the  set  of  all  possible  error 


sequences,  it  is  probable  that  set  E-j  would  approximate  E  very  closely  in 
probability.  Hence,  for  such  cases,  the  performance  of  a  Viterbi  decoder  to 
realize  Eq.  (44)  would  differ  very  little  from  the  hypothesized  generalized 
feedback  decoder. 

Realizing  Eq.  (44)  with  a  Viterbi-like  or  sequential  decoder  has  the 
advantage  over  standard  Viterbi  or  sequential  decoding  in  that  it  allows  for 
a  reduction  of  both  the  number  of  states  and  transitions  in  the  error  trellis 
The  following  theorem  quantifies  the  reductions  achieved  in  the  "pruned" 
trellis.  The  techniques  for  pruning  the  trellis  are  given  in  the  proofs  of 
the  theorem. 

Theorem  2.  The  number  of  states  in  the  pruned  error  trellis  of  a  q-ary 
(n,  k)  CC  with  memory  m  and  a  delay-free  G"1  is 


S(q ,  k,  m,  9t)  =  ^2 
j=0 


m  k  I  / 

.  MJ . 


where  a  =  min  (0t,  m  k)  and  t  =  Jjdfree  -  l)/2]-  Also,  for  the  same  CC,  the 
number  of  transitions  in  the  pruned  error  trellis  is 


T(q ,  k,  m,  9 t) 


where  0  =  min  j^t,  (m  +  1 )  kj. 

Proof:  The  minimization  in  Eq.  (44)  for  error-trellis  decoding  is 
taken  only  over  those  sequences  jj( D )  which  belong  to  g^.  That  is,  by 
Eq.  (40),  the  minimization  is  taken  only  over  those  sequences  jj(D)  which 
have  Hamming  weight  of,  at  most,  9 t  in  every  L  =  m  +  1  consecutive  frames, 
i  .e. ,  WH(Uj  ,  ....  u j+m  )  <  9t  for  all  j  >0.  Hence,  by  Eq.  (30) ,  only  those 


For  the  other  part,  assume  jj( D)  e  2_  Then,  v^(D)  = 

^u(D),  0,  . ..,  oje  2m*  But  this  implies  that 

v(D)  •  G”1  =  [u(D),  0,  ....  °]  [  p  j  =  u(D)«  2m(_1)  . 

Hence,  2  2  and  the  theorem  is  proved. 

Ill  y  I  f! 

Assume  for  the  moment  that  the  minimization  in  Eq.  (22)  is  accomplished 
by  generalized  feedback  decoding,  then,  by  Eqs.  (22)  and  (35),  the  most 
likely  error  sequence  is  found  by 


/  \  mm  .  \ 

WH(e(i D)|  =  ^  ^  (_1}  W^u(D)  3(D)  +  z(D)  R(D)  j  (43a) 

J 

with 

e(D)  =  u(D)  G(D)  +  z(D)  R(D)  .  (43b) 

But,  in  Eq.  (43a),  the  same  estimate  of  e(D)  is  obtained  if  the  minimization 

is  taken  over  any  set  which  includes  2.^"^.  Hence,  since  2,~^  E.  2  by 

-  J  J  m 

Eq.  (37),  and  2^  c  2^  ^  ,  it  is  sufficient  to  take  the  minimization  in 
Eq.  (43a)  only  over  2m  .  Thus,  the  minimum  error  sequence  e(D)  in  Eq. 

in  j  a 

(43a)  can  be  found,  for  all  convolutional  codes  for  which  G“^(D)  is  delay 
free,  by 


min,  ' 

_i  W  u(D)  G(D)  +  z(D)  R(D) 


(44a) 


with 

e(D)  =  u(D)  G(D)  +  z(D)  R(D)  , 
where  u( D)  is  a  "best"  message  correction  factor. 


(44b) 


-19- 


■  « 


+  ... 


WH^j  *  *  *  * »  — j+m )  -  WH^j ( k)  »  *  ’  -j+m(k)]  ’ 

[vj  ,k+l  * -k+l . vj+m,k+l  *  — k+l]  • 

[Vj >n  *  — n*  vj+m,n  * -n]) 

=  Wh(V  -j+m^)  +  Wh(vj ,k+l  ’  ^k+l)  +  +  WH(vj+m,k+l  ’ -k+l)  + 

+  Wn(vj,n  ’  -n)  +  •  •  •  +  (v j+m *  -Pn) 

=  WH^j(k)’  •"»  -j+m(k))  +  Wh(— k+l)  Wh(vj  ,k+l)  +  •**  +  WH^vj+m,k+l)J  +  *’ 

+  WH^n)  WH(vj,n)+  ”•  +WH(vj+n,,n)] 

<W  /v  W  v  ^  L/p\lfw/v  W 

-  Hy— j  *  )  k+1<  i  <  n  |  [WH\Vj,k+l] 

+  WH|vj+m,k+l)  +  •••  +  Wh(vj  ,n)  +  •••  +  WH(vj+m,n)] 

-  e[WH^j(k)”"  ^j+m(l<))  +  Wh(Vj  ,k+l)  +  *•'  +  WH(vj+m,k+l)  +  ”• 

+  WH(vj,n)+ +  WH(vj,n)]* 

where  Q  is  given  in  Eq.  (41).  After  assembling  these  components  of  vectors 
Xj*  •••>  Vj+m  within  the  Hamming  weight  function,  this  inequality  becomes, 
finally, 

Wh(— j ’  -j+m)  -  ®WH (-j*  -j+m) 

<  6  t  , 

where  the  last  inequality  follows,  from  Eq.  (33).  Hence,  by  Eq.  (40), 
ar,d  t*1e  ^irst  part  of  the  theorem  is  proved. 


+  ... 


Thus,  using  Eq.  (39)  in  lemma, 


But,  since  the  Hamming  weight  of  a  sum  of  vectors  is  upper-bounded  by  the 
Hamming  weight  of  the  concatenation  of  the  same  set  of  vectors,  one  has 


‘ikl 

p  where  1^  is  the  k  x  k  identity  matrix  and  P  is  an  (n  -  k)  x  k  matrix 
of  elements  in  GF(q). 

By  the  above  lemma,  there  is  no  loss  in  generality  to  assume  that  G-^ 
has  the  form 


where  _P  ■ 

J 


for  k  +  1  <  j  <  n. 


Now  define  a  message-correction  cylinder  by 


where 


k+1  <  i  <  n 


NUi)-  ’]■ 


Evidently,  1  <  Q  <  k.  The  following  theorem  now  relates  2  with  cylin¬ 
ders  2m  and  2  - 

m,l  m,a 

«l«>re  cylinder  y1  and 

for  1  <  9  <  k  are  defined  in  Eqs.  (36)  and  (40),  respectively. 

Proof:  First,  to  show  2^"^  £2^  suppose  u(D)  =  v/D)  G"^  e 

2  where  v(D)  e  2.  Then,  since  G"^  is  delay  free, 

m  —  m 


•••>  Remember,  by  Eq.  (36),  that  u_(D)  =  v/D)  G  (D).  Thus, 

in  the  special  case  that  G_1(D)  is  del  ay -free  of  form 

*  £i  =  [9il . 9ik]  »  ( 

where  G. .  «  GF(q),  one  has 

*  J 

n 

*  £  VJ1  2,  • 

i=l 

so  that  the  k-vectors  u.  are  linear  functions  of  the  components  of  for 
(j  =0,  1,  ...,  n).  Hence,  a  CC  for  which  G"^  is  del ay -free  has  the  property 
that  the  states  of  the  error  trellis  are  dependent  only  on  m  +  1  successive 
frames,  v.  v v.  _,  of  error  vectors.  Therefore,  in  this  case,  by 

J  »  J  “  I  J  “Hi 

Eq.  (33),  one  needs  only  consider  the  error  cylinder  I  and  its  correspondent 
2  when  endeavoring  to  prune  states  from  the  error  trellis. 

Assume  now  that  G-^  is  del  ay -free,  and  that  G-^  is  an  n  x  k  matrix  over 
GF(q) .  Since  a  code  sequence  ^(D)  =  ^y-j(D),  ...,  y n ( D ) J  has  the  same  dis¬ 
tance  properties  as  a  code  (D)  =  [y-j '  (D) ,  ....  yn'  (D)] ,  where  y j '  (D)  * 
y7r(j)(D)>  where  ir{ j)  denotes  a  permutation  of  the  integers  (j  =  1,  ...,  n), 
it  is  natural  to  call  code  y/ (D)  permutation  equivalent  to  y(D). 

Definition  1.  Two  codes  are  permutation  equivalent  if  one  can  be 
obtained  from  the  other  by  the  same  permutation  of  places  or  coordinates 
in  all  frames.  The  following  lemma  can  be  established  for  permutation 
equivalent  CCs. 

Lemma:  Let  C-|  be  a  CC  with  a  delay- free  inverse  G"^ .  Then  C^  is 
permutation  equivalent  to  a  code  C?  with  a  delay- free  inverse  of  form 


However,  suppose  the  error  trellis  or  its  equivalent,  a  Viterbi  de¬ 
coder,  is  restricted  to  be  a  generalized  feedback  decoder  wherein  the 
correction  of  the  t-th  frame  is  decision  dependent  on  only  the  z-th  and 
the  preceding  J  frames  of  data,  namely  (z^,  ...»  z^_j j  for  (fc  ■  0,  1,  ...). 
Then,  by  construction,  such  a  generalized  feedback  decoder  has  the  property 
that  it  will  correct  any  error  sequence  belonging  to  Xj.  Hence,  if  set 
E-|  is  defined  to  be  the  set  of  all  error  sequences  which  can  be  corrected 
by  a  generalized  feedback  decoder,  using  error  trellis  decoding  or  its  equiva 
lent,  Viterbi  decoding,  with  correction  delayed  by  J  frames,  then 


Evidentl’  ,  set  E-j  is  a  reasonable  approximation  to  set  E. 

The  restriction  of  set  E  to  its  approximation  E-|  makes  it  possible,  by 
the  definition  of  a  state  in  Eq.  (30),  to  often  reduce  the  number  of  states 
as  well  as  transitions  needed  for  the  error  trellis.  To  show  this,  define 
first,  by  analogy  with  set  E^-1^  in  Eq.  (21),  the  sets 

-  {  u(D)  =  w(D)  G-’(D)  |  v(D)  €  Zj)  ,  ( 

corresponding  to  X.  in  Eq.  (33)  for  0  <  j  <  J.  These  sets,  associated  with 

J 

the  cylinders  X.,  also  form  a  family  of  non-increasing  sets,  i.e., 

J 

y  (-l)r>T  ("1)  z>  v  (-1)  -  c  (■!)  ( 

To  prove  this,  suppose  jj(D)  e  X  where  m  <  j  <  r  <  J.  Then,  by 

Eqs.  (36)  and  (34) ,  ( D)  =  vjD)  G-^  (D) ,  where  _v (D)  c  X  c  j  for  m  <  j  <  r. 

*  J 

Hence,  by  Eq.  (36),  u(D)  €  so  that  X.  2.  X  and  Eq.  (37)  is  estab- 

J  vJ  * 

lished. 

By  Eq.  (30),  the  state  of  the  error  trellis  depends  only  on  one  con¬ 
straint  length  or  L  =  m  +  1  successive  values  of _uit  namely. 


For  most  codes  of  practical  interest,  J  >  m. 

Assume,  as  in  Eq.  (12),  that  ejD)  is  a  possible  error  sequence  and 
define 

=  {  e(D)  |  WH  (e£,  ....  e£+J.)  <  t  for  all  i  >  0  J  ,  (33) 

where  t  =  £(dfree  -  1 )/ 2 J  and  [a]  denote  the  largest  integer  less  than  or 
equal  to  a.  Sets  Z.  for  (j  =  m,  m  +  1 ,  J)  constitute  a  family  of  non- 

J 

increasing  sets,  i.e.. 


2  ^  2  ^  => 
m—  m+1  — 


The  smallest  set  Zj  is  the  analogue  of  the  classical  error-correcting  sphere 
in  block  codes.  However,  since  the  elements  of  Z^  are  infinite  sequences, 
it  is  perhaps  better  to  call  Zj  an  error-correction  cylinder  rather  than 
sphere.  In  fact,  a  set  £.,  as  defined  in  Eq.  (33),  is  precisely  the  infinite 

J 

intersection  of  what  usually  are  called  cylinder  sets  of  form 

=  {e(D)  WH  (  e£,  . . . ,  e£+J.)  <  t  j 

for  ( £  =  0 ,  1,  2,  ...). 

For  a  linear  block  code,  the  set  of  all  error  vectors  which  are  correct¬ 
able  unambiguously  by  minimum  distance  decoding  is  equal  to  the  error- 
correction  sphere.  As  a  consequence,  one  might  suspect  by  analogy  for  con¬ 
volutional  codes  that  E,  the  set  of  all  error  error  sequences  which  can  be 
corrected  by  error-trellis  decoding  in  Eq.  (22)  or  its  equivalent,  Viterbi 
decoding,  would  equal  Zj.  However,  the  methods  of  Viterbi  or  sequential 
decoding  require  the  processing  of  the  entire  sequence  before  a  final  deci¬ 
sion  is  made.  Hence,  it  is  suspected  that  there  are  sequences  correctable 
by  standard  Viterbi  decoding  which  are  not  in  Zj,  and  possibly  vice  versa. 


-j  '  -j +  k 


=  F(V  sj)+ij 

in  terms  of  inputs  u.  and  r.  and  internal  state  S-,  where  initially  jj.  =  0 

J  J  J  J 

for  j  <  0  and  Sg  =  0. 

The  states  S.  of  the  error  trellis,  defined  in  Eqs.  (24)  and  (25b),  for 

J 

an  (n,  k)  CC  of  memory  m  are  elements  of  the  set  of  m-tuples  of  k-vectors  as 


follows: 


n  =  {(g^,  ....  am)  j  «  Vk  (F)1  1  <  i  <  m  j.  ( 

By  Eq.  (26),  a  possible  state  Sj  =  (o\j,  .  in  il  is  equal  to  the  m-tuple 

of  m  consecutive  past  frames  of  the  input  u..  That  is, 

Sj=(-1 . -m)  =  (— j-1 . -j-m)  •  ^ 

where  u.  =  0  for  j  <  0. 

J 

In  convolutional  codes,  the  j-th  column  distance,  denoted  by  d.,  is  de- 

J 

fined  [6]  as  the  minimum  Hamming  weight  of  the  first  (j  +  1)  codeword  frames, 

where  the  0-th  information  frame  is  non-zero.  That  is. 


i'n^0 


where  Wu(-)  is  the  Hamming  weight.  The  column  distance,  d.,  forms  a  non- 

decreasing  sequence  for  (j  =0,  1,  ...).  For  a  basic  encoder,  the  limit 

of  d .  is  reached  in  a  finite  number,  J,  and  equals  the  free  distance,  dj. 
J 

Thus , 

dj  <  dfree  for  j  <  J 
=  dr _  for  j  >  J  . 


#“■  r 
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Equation  (23)  can  be  expressed  in  standard  sequential  circuit  form 
by  defining  the  state  S.  of  the  circuit  at  frame  j  to  be 

J 

Sj  =  (-j-1 . — j-m  )  * 

the  m-tuple  of  the  previous  m  values  of  the  k-vector  u.  in  sequence.  Ex- 

J 

pressed  as  a  sequential  circuit,  where  initially  u .  =  0  for  j  <  0  and 

J 

SQ  =  0,  the  coding  trellis  in  Eq.  (23)  is  given  by 

=  F(v  sj) 

sj+i  * p  ( •  sj )  • 

where  "P"  denotes  the  projection  along  the  first  m  components  of  /  u . ,  S.  )  = 
(Uj.  Hj-1 . Uj.m)  as  follows:  ^ 

P  (-j*  Sj)  =  P  (%•  -j-1 . -j-m)  =  (-j . — j-m-1  )  4 

The  next  state  equation  of  the  encoder  sequential  circuit,  associated  with 
the  error  trellis,  is  Eq.  (25b)  and  its  output  equation  is  Eq.  (25a). 

Let  the  formal  power  series  for  z(D)  R(D)  and  e(D)  be 


(25a) 


(25b) 


2(D)  R(D)  =  XI  -j  °j 
j=0 


(27a) 


e(D)  =  V  e  Dj  . 

"  J 

j=0 


(27b) 


Then,  by  equating  coefficients  of  DJ  and  using  Eqs.  (25)  and  (27),  the  output 
label  at  frame  j  of  the  error  trellis  is  given  by  the  n-vector 


By  Fig.  1,  the  output  of  the  encoder,  in  terms  of  input,  is 


is  the  matrix  R(D)  needed  in  the  error  trellis  solution,  Eq.  (17a),  of  the 
syndrome  equation. 

It  is  easy  to  show  from  the  above  generating  matrix  G(D)  that  d^rge  ■ 

4  and  t  =  1.  Thus,  by  Theorem  2,  S(q,  k,  m,  t)  =  8  and  T(q,  k,  m,  tl)  s  15. 
Thus,  the  error  trellis  can  be  pruned,  as  shown  in  Fig.  2.  In  one  constraint 


length,  only  1L  transitions  are  needed  in  the  error  trellis,  whereas,  for 
the  standard  decoding  trellis,  64  transitions  are  required.  This  yields  a 
reduction  of  15/84  ~  1/4  in  the  number  of  transitions  needed  for  error- 
trellis  decoding  below  the  number  required  for  standard  Viterbi  hard  decod¬ 
ing. 

The  labels  on  the  pruned  error  trellis  shown  in  Fig.  2  correspond 
to  the  solution,  Eq.  (17a),  of  the  syndrome  equation  for  the  actual  error 
sequence  equal  to  the  all -zero  sequence.  That  is, 

e(D)  =  [er  e2]  =  u(D)  G  =  u(D)  [l  +  0,  1  +  a  d] 

=  [t  +  Dt,  t  +  a  Dt] 

are  the  output  of  the  trellis.  For  example,  at  frame  time  j  and  state  0,  if 
ujD)  =  a4,  then 

e^(D)  =  £a4  +  0,  a4  »  a  .  oj  =  [a4,  a4] 

is  the  label  on  transition  from  state  0  to  state  1.  Such  a  transition  repre¬ 
sents  an  attempt  to  "cancel"  a  single  error  in  the  error-trellis  equation, 

Eq.  (17a).  If  such  an  error  does,  in  fact,  occur  at  frame  j,  then  no  further 
errors  are  allowed  to  occur  at  frame  j  +  1.  Thus,  a  transition  to  other  than 
state  0  must  be  followed  by  a  transition  back  to.  state  0  in  the  next  frame, 
as  shown  in  Fig.  2. 

.  4  4 

Next,  suppose  a  transition  to  state  a  occurs,  i.e.,  Dt  =  a  .  Then, 

4 

since  u(D)  =  0,  the  transition  from  state  a  back  to  0  is  given  by 

e(D)  =  [o  +  a4,  0  ♦  a  .  a4]  .  [a4,  a5]  . 

The  remaining  labels  to  the  "pruned"  error  trellis  are  obtained  in  a  similar 


manner. 


To  illustrate  pruned  error-trellis  decoding  of  the  dual -3  CC,  let  the 


generating  function  of  message  of  information  sequence  be 

x(D)  =  1  +  a  D  . 

Then  the  codeword  sequence  is,  by  Eq.  (3), 

£(D)  =  x(D)  6(D)  =  [l  +  a5  D  +  a  D2,  1  +  a2  D2]  . 

Next,  let  the  actual  error  sequence  be 

ea(D)  -  [d2,  a] 

so  that 

z(D)  =  ^(D)  +  ea(D)  =  [l  +  a5  D  +  a5  D2,  a5  +  a2  D2]  . 

Hence,  by  Eq.  (17b), 

a2  +  a3  D,  a3  +  a4  D 
a2  +  a2  D,  a3  +  a3  D 

■  [a3  +  a3  0  +  a2  D2  +  a3  D3,  a4  +  a4  D  +  a3  D2  +  a4  D3] 

*  [<*3»  a4]  +  [a3,  a4]  D  +  [a2,  a3]  D2  +  [a3,  a4]  03  . 

The  finding  of  the  minimum-weight  error  path  £(D)  in  terms  of  u(D)  is 
equivalent,  by  Eq.  (17a),  to  finding  the  codeword  u/D)  G(D)  which  is  closest 
to  z(D)  R(D),  as  given  above.  Hence,  the  minimum-weight  error  path  can  be 
found  by  applying  the  Viterbi  decoding  algorithm  to  the  pruned  error 
trellis  in  Fig.  2.  To  accomplish  this,  the  frames  of  zjD)  R(D)  are  added  to 
the  outputs  u(D)  G(D)  in  the  pruned  error  trellis  in  Fig.  2,  as  shown  in 


z(D)  R(D) 


■t 


1  +  a  D  +  av 


D2, 


a5  +  a2  D2 


-32- 


In  order  to  illustrate  the  Viterbi  algorithm  as  applied  to  the  pruned 
error  trellis,  suppose  the  decoder  has  reached  frame  4.  The  output  of  the 

3 

transition  from  state  a  to  state  0  is 

C°3f  [u(D)  6(D)  +  z(D)  R(D)] 

.[<*«/]♦[«».  «<*].[o.o] 

with  Hamming  weight  0.  A  similar  calculation  for  the  other  seven  possible 

3 

transitions  shows  that  the  transition  from  a  to  0  is  the  only  one  with 

3 

Hamming  weight  0.  The  path  segment  from  a  to  0  is  chosen  since  it  has 
minimum  weight. 

At  frame  5  in  Fig.  3,  the  minimum  weight  estimate  of  the  D-transform  of 


the  error  sequence  is 


£(D)  -  [o,  a]  +  [l,  o]  D2  . 


Hence,  the  estimate  £(D)  of  the  message  correction  factor  which  achieves  e(D) 


u^(D)  =  a3  +  a3  D2  . 

Finally,  using  the  above  results  in  Eq.  (19)  yields,  by  Table  1, 


x  =  z(D)  G"1  -  u(D)  =  z(D)  B”1 


=  z(D) 


q3,  a2  +  a3  ol  f  1 


a2,  a2  +  a2  D  L  0 


-u(D) 


u(D)  =  z(D)  -  u(D) 
~2 


=[,ta5D+D2-“5+a2D2]Ut(a3+a2tD2) 

*(a3+aD+a3  °2*l-4»2)+(a3  +  a^)  =  ,+aD, 

v  v  .  v.\  v-  v 


-33- 


the  original  encoded  message. 


V.  CONCLUDING  REMARKS 

In  this  report,  pruned  error-trellis  decoding  of  systematic  and  non- 
systematic  convolutional  codes  with  a  delay-free  inverse  has  been  developed 
in  detail,  including  quantitative  formulas  for  the  number  of  states  and 
transitions  which  remain  in  the  pruned  error  trellis.  Currently,  the  prob¬ 
lem  of  trellis  pruning  of  other  non-systematic  CCs  is  being  investigated. 
Finally,  the  reduced  hardware  requirements  for  pruned  error-trellis  decoding 
versus  standard  Viterbi  decoding  is  being  studied,  and  a  preliminary  archi¬ 
tecture  has  already  been  found  for  the  dual-K  decoding  algorithm  developed 
in  this  report.  '  ‘‘  \ 
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